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ABSTRACT 


A brief history of the development of tables of random numbers is 
presented. This is followed by an investigation of the generation of 
uniformly distributed random numbers on the CDC 160); digital computer. 
Certain congruential relations are examined in order to determine 
parameters that will produce acceptable random number sequences. Ste- 
tistical tests of the generated numbers are described and the results 


of these tests are presented. 
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PREFACE 

Statistical sampling techniques using a "Monte Carlo" method can 
require a large supply of random numbers. Such techniques have been used 
in the evaluation of definite integrals, the solution of ordinary and 
partial differential equations, the solution of integral equations and 
matrix inversion, to mention only a few applications. On the other hand, 
some problems, described in terms of a stochastic model, can only be 
solved by recourse to sampling techniques, as for example, mathematical 
war gaming problems. 

When such models are programmed for a high speed digital computer, 
it is necessary to produce large numbers of random numbers quickly. The 
tables of random digits or random uniform numbers customarily used in 
hand calculations do not lend themselves to use in the high speed compu- 
ter. It is inefficient to read into the computer the large number of 
random digits required to meet the demand for such numbers arising in 
problems which are otherwise within the capabilities of the machine. 
Using the same numbers repeatedly in a given problem would introduce 
bias. 

It has been suggested that special equipment which utilizes a phys- 
ical random process be used to produce the random numbers. On the sur- 
face, this would seem desirable but, in practice, difficultics immedi- 
ately arise. First, the construction and maintenance of such a device 
would be quite expensive. To be useful, it would be required to produce 
a number in the order of a few microseconds for extended periods. Reli- 
ability requirements would necessitate its output to be continually 


checked for randomness. The second difficulty is that such a random 


ot ie 





device would not permit the recalculation of a problem using the same 
generated numbers. This could prove to be a handicap in checking on 
machine operations. 

These difficulties lead to the exploration of arithmetical schemes 
for generating numbers which take advantage of the computer's high speed 
and place minimum demands on computer memory storage. Such numbers 
could not be considered truly random; hence, the adoption of the term 
"pseudo-random numbers". 

It is the intent of this paper to present a brief historical devel- 
opment of random number generation and to explore, in particular, the 
multiplicative congruential type of generator for adaptation to the 


CDC 160). 
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I. INTRODUCTION 
1. Kendall and Smith's Random Sampling Numbers. 

It might be supposed that if one wanted to obtain a sequence of 
random digits that almost any method that employed some sort of chaotic 
selection would suffice. Observing the indications of a roulette wheel 
or, perhaps, an unsystematic selection of digits from a large telephone 
directory might seem like "reasonable" methods. Once the digits were 
collected, however, a problem would arise. By what criterion could 
their randomness be judged? 

It was to this problem that M. G. Kendall and B. Babington Smith 
addressed themselves in their now classic work, "Randomness and Random 

i 
Sampling Numbers". 1 : They start their paper with a discussion on 
the relation between randomness and probability. 

In colloquial speech the word "random" is applied to any method 
of choice which lacks aim or purpose; and this usage is also found 
in certain sciences. In statistics, however, the word has a some- 
what different and more definite significance, closely related to 
probability. It appears, in fact, that for statistical purposes 
the ideas of randomness and probability are inseparable, whether 
one belongs to the "intuitive" school which regards probability as 
an undefinable, or to the opposing "frequency" school which seeks 
to explain it in terms of statistical frequencies. 

Having linked the concepts of probability and randomness, Kendall 
and Smith concede that there are many controversial areas that pose 
abstract problems that are,--~ "verging, at times, on the theological." 
In the interest of getting on with the more mundane problems of Random 
sampling Numbers, they state: 

We take randomness and probability to be undefined ideas obeying 


certain intuitively formulated principles, which will be found suf- 
ficient to give results of practical application. 


Numbers in brackets refer to references cited in the Ribliography at 
the end of the paper. 





The authors note that any combination of digits could be considered 
a random set, in that all combinations are possible when drawing from an 
infinite universe of digits. However, not all combinations are practical 
for random sampling; for example, a block of digits which are all the 
same. For purposes of sampling, a definition of Random Sampling Number 
is made: a set of numbers which can be used for random sampling, not 
necessarily a set obtained by random methods. It is Kendall and Smith's 
intention to produce a table of random digits and to test these for 
acceptability as Random Sampling Numbers. That is, Random Sampling Num- 
bers is a collective term for a series of random digits. 

Such a set of Random Sampling Numbers should conform to the notion 
of relative frequency. That is, if samples were taken from a universe 
consisting of an infinite number of digits such that each of the ten 
digits 90 through 9 may be expected to appear with the same frequency, 
then, it is expected that in a large number of trials each of the digits 
will appear approximately an equal number of times. Further, it would 
be expected that each pair of digits would occur approximately the same 
number of times, similarly triples, etc.; that is, no digit would tend 
to follow another digit in any consistent pattern. 

Here then is an important distinction: any given set of N numbers 
need not follow such expectations but a set of Random Sampling Numbers 
Should do so. Hereafter, we shall identify such a set of Random Samp- 
ling Numbers as a random set. 

In order to ascertain whether a given set of digits is a good enough 
"approximation" to a random set, the observed frequencies may be compared 


to the theoretical frequencies by means of a chi-square test. On "purely 





arbitrary grounds", Kendall and Smith elected to say that a set of num- 
bers is acceptable if the resulting chi-square statistic is within the 
range of 0.01 = P = 0.99, where, with 0; the observed and E; the 
expected frequencies of the jth digit, 
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with 9 being the number of degrees of freedom. 


Having thus determined the characteristics of this set, the authors 
then proceed to establish a series of four tests for a set of digits to 
be considered as a random set. Subsequently, these tests have become 
standard in the literature on random numbers. Briefly, they are as fol- 
lows: 

(a) The frequency test to determine if all the digits of a set 

occur in approximately equal amounts. 

(bo) The serial test which tests to see if any digit tends to 
follow another. 

(c) The poker test which is named after the card game and examines 
the digits in blocks of five for five of a kind, four of a kind 
etc. The frequency of these hands are then compared with the 
expected distribution of such hands. 
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(d) The gap test which inquires into the gaps occurring between the 
same digits in a series and compares this with the theoretical 
distribution. 

Although these tests are not sufficient, the authors feel that collec- 
tively they are powerful. 

With the foregoing criteria established, Kendall and Smith attempt 
to set up a random series by selecting digits from the London Telephone 
Directory ina haphazard fashion. Even though they took what would seem 
like extra precautions to avoid bias, the results of their tests indi- 
cated significant bias. 

They then constructed an electro-mechanical device to produce random 
digits. Essentially, it consisted of a wheel divided into ten equal seg- 
ments with a digit assigned to each segment. It was driven by an elec- 
tric motor whose speed was accurately controlled. The operator on a 
separate circuit controlled a neon lamp whose flash was of such a short 
duration that the moving wheel appeared to be stationary. The number 
that appeared under a previously fixed pointer was recorded as the ran- 
dom digit. The actuation of the lamp was kept haphazard by having the 
operator move a stylus over an intricate maze of circuitry which, it was 
maintained, caused the light to operate in a random fashion. 

A table of 100,000 digits was produced in this fashion and, based 


on their tests, were considered acceptable as Random Sampling Numbers { 1]. 


¢. RAND Corporation's Table of Random Digits. 
Kendall and Smith's table appears to have been satisfactory for a 
number of years; at least, there were no other tables published. In 


1947, a group of research workers in the RAND Corporation responded to a 
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growing need for random numbers by producing a table of one million ran- 
dom digits 2 | . These were needed to solve various computerized 
research problems by experimental probability procedures. Such proce- 
dures were called by the colorful name, Monte Carlo Methods. For many 
of the problems that RAND was working on, the need for numbers far 
exceeded those of Kendall and Smith's Tables. In fact, it meant that 
these tables would have to be used time and again in a particular prob- 
lem. This presented the consequent danger of introducing unwanted cor- 
relations. Such a growing thirst for random numbers was, of course, due 
to new problems and rapidly developing computer technology available for 
solving these problems. 

RAND'S table was produced by electronic-mechanical means and was 
apparently more sophisticated than Kendall and Smith's machine. 

In principle the machine was a 32-place roulette wheel which made, 

on the average, about 3,000 revolutions per trial and produced 

one number per second. 

It appears that the engineers designing the device did have many 
problems because the original machine showed statistically significant 
‘biases and had to be modified extensively. Further, even though elec- 
tronic checks were continually made, after a month the results of tests 
of a block of numbers produced showed significant bias. This indicated 
that the machine was running down even though the electronic checks were 


acceptable. 


3. Pseudo-random Number Generation. 
While RAND'S table was suitable for work with punched card compu- 
ters, it proved impractical with the advent of high speed electronic 


computers because: 





(a) it placed severe storage requirements on the machine 

(b) reading in the tables was slow 

(c) much larger tables were required for the solution of certain 

types of problems, 

These shortcomings led to an interest in methods for producing 
"pseudo-random" numbers by arithmetic means. A definition of this tern 
was given by D. H. Lehmer as: 

A pseudo-random sequence is a vague notion embodying the idea of 

a sequence in which each term is unpredictable to the uninitiated 

and whose digits pass a certain number of tests traditional with 

statisticians and depending somewhat on the uses to which the 
sequence is to be put. 

One of the first methods used was proposed by von Neumann and Metro- 
polis wherein an arbitrary n-digit number, Yo > was selected. This num- 
ber was then squared and Yy was produced as an n-digit number from the 


2 
center of the @ n-digits of Y) . The process was then repeated, produc- 


ing Y3,) from Y;. 


Although this process generated pseudo-random numbers, its period 
was not predictable. Further, it was demonstrated by D. H. Lehmer 3] 
that, in general, it would not produce a very long cycle before the 
process degenerated into a sequence of zeros. Consequently, it could 
not be recommended as a source of preat quantities of random numbers. 

During the second symposium at the Harvard Computation Laboratory, 
D. H. Lehmer suggested 3] that the multiplicative congruential rela- 


tion (See Appendix A) 


iv = LY (mod M) (1-1) 


r+] r 


could be used where the least positive residue is taken as the generated 





number. In effect, this statement says that (Y LY.) /M is an 


an 
integer. The modulus M could be selected so as to be compatible with 
the base of the machine being used; that is, it could be of the form 
(2P) or (10"), P normally is selected to be the mnmereeta positions 
available in a word of the computer being used. Further, in a binary 
machine, with a proper selection of L, the period of this generator is 
Bee?) 

Using P in this way avoids the necessity for performing the final 
division operation in most high speed computers. For example, in the 


CDC 160) using a modulus of oh 


» the multiplication (MUI ) of the number 
L by Y, positions the product in the QA registers. The 7 low-order 
bits of this product in the A register are, in fact, the desired resid- 
ual or the newly generated random number. ; 

Although such a method is relatively fast, nevertheless, it does 


use the multiplication command which, for a computer, is time consuming. 


Greenberger | proposed that L in equation (1-1) be modified to 


ee oa) X. (mod 2°) (1-2) 


with A = 3. By doing this, the multiplication could be performed by a 
shift command and 3 add commands which represented a considerable saving 
in time over the straight multiplication instruction. Further, he shows 
B that an L of this form does satisfy the conditions necessary to 

insure the full period of uae 

Another variation of equation (1-1) was proposed by Rotenberg (6! 
whose generator became 
*) 


Yp+ 


m2 +1) v+c (mod 2 (1-3) 





This again was faster than Greenberger's model by one add instruc- 
tion but, unlike the former, this gave a full period of oi = 2 
and C an odd integer. 

Equations (1-2) and (1-3) will be investigated in Chapters II and 


III for use as pseudo-random number generators for the CDC 160 computer. 





II. EMPIRICAL TEST PROCEDURES 
1. General 

Previous studies made using other machines 5 | lo] I? indicated 
that the quality of the generated pseudo-random numbers using equations 
(1-2) and (1-3) can be very sensitive to the selection of A. Aside from 
a guarantee of the period length (See Appendix A), there is, presently, 
little mathematical theory to aid in the selection of parameters that will 
enhance the random attributes of the generated samples. The task is 
largely an empirical one wherein an A and a starting value, Y,, are se- 
lected and the subsequent output is submitted to a battery of tests. 
Such empirical analysis can only give general indications as to desirable 
parameters because it is only practical to investigate a small portion 
of the total cycle of the generator. This fact can be appreciated by 
observing that using a modulus of ohT in (1-2) the fraction of the full 
period that would be considered, if even, say 10” elements were Lesitved, 
is approximately 107°. 

As has been cited in Chapter I, the tests for randomness offered by 
Kendall and Smith were designed to check on the randomness of the indi- 
vidual digits of their table of random numbers. The generators under 
discussion here, however, produce pseudo-random numbers. Consequently, 
those tests that lend themselves to investigating certain properties 
associated with random numbers rather than those associated with indi- 
vidual digits are favored in the investigative work herein reported. 
Should a requirement arise for random digits when using equations (1-2) 
and (1-3), only the high order (left-most) bit positions of the binary 
representation of the generated number should be considered. (See 


Appendix A ) e 





2. Uniformly Distributed Random Variables. 

Many uses of computerized sequences of random numbers require that 
they be uniform (0, | . Not only is this distribution used in its own 
right but numbers from this distribution may be used to produce numbers 
with other distributions. Converting the generated pseudo-random numbers 
of equations (1-2) and (1-3) to uniform [0,2 is quickly accomplished in 
a high speed digital computer as it merely requires the positioning of a 
decimal point to the left of the generated numbers. (See Appendix B.) 

3. Description of Tests. 

The following tests were performed on the blocks of numbers forming 
a generated sequence. 

(a) Frequency test: The classic chi-square goodness of fit test 
was used as a frequency test /9| » This test divides the unit interval 
into k equal parts. The generated numbers were each checked and a tally 
kept on their frequency distribution within the k intervals. This is, 
perhaps, the most frequently applied of the random tests. 

Attempts have been made to establish some sort of an optimal size 
for k. Mann and Wald, in their paper [8 | » present a procedure by 
which the lengths of the class intervals are determined so that the prob- 
ability of each class under the null hypothesis is equal to 1/k where k 
is the number of class intervals. They establish a Theorem which says 


that, in the limit, the best (defined by them) value of k is given by 


‘ 1/5 
ac 
where C is determined so that 
1 : =s 
e € ax 
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is equal to the size of the critical region (probability of the critical 
region under the null hypothesis} and Ni is the number of random elements 
generated. For significance levels of .01 and .05 the corresponding 
values of C are 2.327 and 1.645 respectively. These, in turn, lead to 

= 88 for .Ol and k= 1] for .05 with N 5000. A k = 100 was used in 
the large sample tests reported on in Chapter III. 

(bo) Serial Correlation: This test was set up following the formula 
given by Kendall 9) « Lags from 1 through 10 were examined. 

(c) Run test: The nunber of runs up and down were examined and com- 
pared with the expected theoretical distribution using a chi-square test. 
The first manbver to be examined is compared with the next number in tne 
sequence, If the second is larger than the first, a run up of size one 
has occurred but is not yet recorded. The second and third numbers are 
then examined in the same way. If the third is greater than the second, 
a run of size two has occurred but is not yet recorded, If the third is 
smaller thar the second, a run of size one is recorded etc, 0] ‘ 

(d) Moments: The an four central moments were computed and con- 
yared with those for the uniform Ay] distribution. 

(e) Poker Test: This test was used to examine the sequence of 
digits within a number. Starting at the hignest order bit position, the 
digits were grouped to give five octal digits (15 bit positions). The 
noker hand value of these five digits was tallied. The next 15 bit 
positions of the same random number were similarly examined and tallied. 
The succeeding generated numbers were examined in a similar fashion. A 


chi-square test was performed on the results. 
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4. Small Sample Tests. 

Exploratory tests were made on generators (1-2) and (1-3) using two 
different starting values, Y, = 1 and Yy = ao = i, ample sazes of 
109 were taken at each: A = 5,10,15....,45. This was done in order to 
ascertain, ina general way, the sensitivity of the generators to 
changes in A. The results presented in Tables 1 - 8 of Chapter III 
indicate that for these particular starting values the selection of A 
Poeeriiical. ay further confirmed that a good selection of A was approx- 
imately ( pel for both generators. This supp -rted the rule of thumb 
that was offered by Greenberger B ; 

Selection of Yo = 1 also gives an indication of how the generator 
would respond when faced with a low number. Cases have been reported 
I? where, when this occurs, an unfortunate selection of A will make 
the generator extremely sluggish and can produce several hundred low 
numbers before yielding a more chaotic output. This succession of low 
numbers gives, in effect, a sequence all below 0.5. Such a character- 
weulc, Obviously, is undesired. 


Parallel procedures were followed using a sample size of 19990 nun- 


bers at each A. 


S. Large Sample Tests. 
Having selected a value of A from the results of the small sample 
tests, five blocks of 10,000 numbers were generated and subjected to the 


battery of tests. 


6. Modifications. 


Experiments were conducted to investigate the quality of pseudo- 


random numbers generated by addressing the output of one generator to a 
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second one having a different value of A. This linking process for 


equation (1-2) and (1-3) can be described as follows: 


= A } 

eee 3) eT (mod gly 

= Clee 1) ¥,41 (moa 247) G1) 
where ante = eens seseeeee|becomes the sequence of interest. 
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JiI. RESULTS AND CONCLUSTONS 
1. Interpretation of Data. 

No attempt was made to select the best parameter A based on the 
small sample tests discussed in Chapter II. Rather, the observed data 
was prepared in tabular form for each of the generators. This allows 
general comparisons to be made on the effects of changing the inputs on 
each of the generators being investigated. In this way, certain ranges 
of parameters are observed to produce small samples whose performance is 
consistently poor under the battery of tests. It also permits a compari- 
son of one generator with the other as A is varied. For if one assumes, 
as will be done in this paper, that the generator that exhibits the 

least number of these "poor" indications is the preferred generator 
such a tabulation will make it epparent. This assu: p:ion does not take 
into account the differences in speed of generation between (1-2) and 
(1-3). In the CDC 160), this difference is one add instruction or 7.2 
micro-seconds per word and although this is a consideration, for pur- 
poses of this analysis, it will be considered minor. The same can be 
said of the differences in cycle length between the generators under 
consideration because each of them is extremely large. In short, the 
point of focus in the discussion to follow is to be centered on the gen- 
erators ability to produce pseudo-random numbers that are of practical 
use. 

A particular parameter was considered "poor" if it failed to pass 
anyone of the following arbitrarily selected criteria: 


(a) The frequency chi-square statistic, Xo° must be 


Bo ea Xoe = 16.9 that is, for 9 degrees of freedom the 


Wy 





critical values of the chi-square were selected as .95 and .95. 
(b) No long runs; where a long run will be considered a run of 
length six or greater for samples of size 100 and of length seven 
or greater in samples of size 1000. 
(c) Serial correlation coefficients of lag one must be less than 0.2 
for samples of size 100 and 0.05 for samples of size 1099. 
(d) Mean greater than or equal to 0.430 and less than or equal to 


0.570% 


2. Results of Small Sample Tests. 
Tables 1 through 8 indicate the results of varying A in the gener- 


avors 


ea = (2aA4 oe (mod o47 (3-1) 


and Yel = | of 4 1) as (mod oT) (3-2) 


In tables 1 through h Y 1 while in tables 5 through 8 cetera 
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In applying the criterion of paragraph (1), the number of values 
of A reflecting a "poor" sample in the case of (3-2) is greater than 
for (3-1). Generator (3-2) is very susceptible to long runs and, above 
an A of 25, cannot be considered a practical random number generator. 
Using different starting values did not improve the output of (3-2); 


= oh 


for Y>y = 1 only an A = 21 passed and for Yo - 1 no value of A 
passed. Even relaxing the criteria so that two failures are allowed 
before an A is considered "poor" only admits A = 18 and A = 20 as being 
acceptable for both starting values. Further, using a value of 

Cc = (.788) oh? as suggested by Coveyou ino did not make any notice- 


able improvement in (3-2). 


Table 9 shows the acceptable A's for generator (3-1). 


TABLE 9 
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The above test results indicate that for these criteria generator (3-1) 
is much less erratic than (3-2). Six different values of A, 
( A = 10,19,21,22,35,0), are acceptable using both starting values and 
therefore it is suggested that A be chosen from these values. 

Some tests were also made using the approach suggested by equation 
(2-1). Aside from some improvements in the poker tests results, no 
noticeable improvements were made. Using this approach increases the 


time of generation per word and unless the results registered a 
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significant improvement, it could not be economically justified. 


3. Results of Large Sample Tests. 

Using the findings of paragraph (2) above, an \ = 19 was selected 
for the five large samples to be tested. This particular value was 
selected over the faster A = 10 because it appeared to perform slightly 
better than A = 10 in the poker tests. The results are presented in 
Tables 10 through 17 and indicate that the samples satisfactorily pass 
the battery of tests. Figure 1 .shows the average serial correlation of 
the five samples for lags one to ten. wer these lags the correlations 


ranged from +0,0152 to -9,9201. 


. Remarks on Computer Work. 

All of the tests used in this paper were written using FOXTRAN 
language. These have been catalogued and are available at the Computer 
Center at the United States Naval Postgraduate School. Programs using 
an A = 19 in Equation (3-1) have been written both for Scrap and for 
Fortran and are available in the computer library. Appendix B lists 
the machine language steps required to produce a random number accord- 
ing to (3-1) and the additional steps required to convert it toa 
floating point fraction between zero and one. These steps have been 
included to allow a programmer a little more flexibility in using the 
generator. In many cases, it may be easier to include these steps 


directly in a routine rather than go to a computer library subroutine. 


5. Conclusions. 
(1) To determine 2 suitable value of A in (3-1) the methods used 
in small sample testing are useful. Further refinements could be 


made by 
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TABLE 1O 


FREQUENCY TEST 
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TABLE 11 
RUN TEST 
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TABLE 12 


RUN TEOT CHI-SQUARE RESULTS 
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TABLE 13 


POKER TEST (FIRST FIVE OCTAL DIGITS) 
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TABLE 1) 


POKER TEST CHI-SQUARE RESULTS (FIRST FIVE OCTAL DIGITS) 
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TABLE 15 


CHARACTERISTICS OF GENERATED UNIFORM DISTRIBUTION 
(FIVE SAMPLES OF 10,000) 
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tN A EAE gO REED tN TON A NY RATERS I) Se OS Ca tM Mg 


.00029 .00073 .0ood1k .00002 .90001 0.900990 


AAT eRe 2 Aan TD amen eee Ne nr a pag OE me Se ee rah ORR RR Arne FL Nee NS RN LE A OC Ont TNNITT w s t 2 er eOR ee ae wre pimempie _-s 


601255 .91288 «= «.012h6 = «01238 ~=— .012781 =| «0.01250 


4 


I PE eo Sey Sg Bm Oe rr ms ee Ge ee er re ee ene mee 








SKEWNESS 


KURTOSIS 
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TABLE 16 


POKER TEST (SECOND FIVE OCTAL DIGIT POSITIONS) 


ne ee 














| POXER 
#1 #2 #3 #hy 
2055 2053 2065 1997 2961 
S118 Sth «= 5156 SUB 5228 
S39 1560 1520 1561 1535 
1029 1008 oll 1032 «995 
168 = 192—iaHC“(<«i‘iSC«dN 
FOUR OF A KIND 89 12 75 101 89 
sit a = 


a OF A-KIND 





Ee ee 


*Pour and Five of a Kind combined for 5 d. of f. 


TABLE 17 


POKER TEST 








ly 
4.85 
| 
| PROBABILITY | —99 (.98-.95) (.95-.90) (.50-.30) 


| INTERVAL = | 





SS I + _— ee ee cee 
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| axp, 
em NUMBER 
i cl 





| 
| 
| 


eee ee | 


| 


2050. “ 


—~l- 


| ie 5126, 95 
=. 
| 1538. 09 
i 
: 1025. 39 
yO 170. 90 
85.45 
2 ily 


CHI-SQUARE RESULTS (SECOND FIVE OCTAL DIGIT POSITIONS) 


#5 


ES 





— 


(90-80) 





—- 





| 
7 


ea 


(a) selecting a few more starting values, Yj, and tabulating 
the results as in Tables 1 through l. 

(b) modifying the criterion used in the sifting process to 
meet an individual's unique demands. For example, if he 
desired an unbiased sequence of uniforms, he could tighten 
the restriction on the acceptability limits of the mean. 

(2) Based on the test results, Equation (3-1) is to be preferred 
to (3-2) as a generator of random numbers. The speed advantage in (3-2) 
of about 7.2 micro-seconds per word is masked by its demonstrated ten- 
dency toward long runs and poor frequency distributions for most 
values of A. 

(3) Five blocks of 10,000 numbers each were tested using gener- 
ator (3-1) with A = 19. All tests were passed satisfactorily and it 
is concluded that 
eS (3-3) 


ue = ( ol? . 3) 1a (mod 2 


r+ 
will perform as a pseudo-random generator for the CDC 160k. 

(4) The chaining of generators as suggested in E.,wation (2-1) did 
not lead to any positive indications that a better sequence of pseudo- 


random numbers would result. Unless this can be shown, the additional 


time of the generation is not warranted, 
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APPENDIX A 
USGNBRATOR CHARACTSRISTICS 
i. General. 

In 1949, Lehwer 3 Suggested using the multiplicative congruen- 
tial method for generating pseudo-random numbers on a digital computer. 
Since tnat time, this method has been used, tested and reported on by 2 
number of investigations, for example 12 13 . In using this 
method on a binary computer of word size P, the recurrence relation most 
frequently takes the form 


ee eee (mod 2°) (en) 


where L is a fixed odd integer and 0<Y,, =2°_1 being relatively prime 
to 2°, This sequence is periodic, its length depending on the choice of 
L, Y, and Pp. When the indicated iteration is performed, say n times, 
(1-1) becomes: 


n P 
a > Ci ae (mod 2°) 


and for this reason is sometimes referred to as the power residue method. 
The numbers tr/5P (r=9,1,2.... ) become the desired uniformly dis- 

ane 
tributed elements from the unit interval. Barnett 1h in 1961 showed 


that necessary and sufficient conditions to obtain a maximum cycle of 


P~2 


2 elements, all of which are distinct, are: 
Pei = 3 Gnod @) or IT = 5. (mod 8) (A-2) 
(ob) Y,= 1 (mod 2), hie) 
that is, Y, must be odd. 


If these conditions are not met the cycle length can be 2ffected 


and in some cases the process degenerates to zero. 
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2. Discussion of Equation (1-2). 

While the sequence produced by (1-1) has been found satisfactory, 
nevertheless, it does imply the use of the multiplication instruction in 
the computer. Relatively speaking, this is a time-consuming operation 
and interest continued on methods that would avoid its use and still pro- 
duce the desired sequence. Greenberger " proposed using an L of the 
form (2% + 3) where 2 = A <P. Itis seen that such an L satisfies 
the conditions necessary to insure a cycle of ag This form has the 
advantage of substituting a shift operation of size A and tnree add 
instructions for the multiplication operation. In the CDC 160h, the 
multiplication instruction as requires 25.2 plus .8n micro-sounds; 
where nis the number of ones in the multiplier. %n the other hand, the 
shift instruction takes 2.8 plus .4s micro-sounds, where s is the number 
of places shifted and an add instruction takes 7.2 micro-sounds. In 
operations where large quantities of random numbers are required this 
saving can make a significant contribution toward reducing machine time. 

When this form of Lis translated to the CDC 169h, equation (1-1) 


becomes; 


oe ames 9) 1, (mod 247) (A-2) 


where with 2 = A < h7 the period of the sequence is gue that is, 

95 distinct numbers are produced before the sequence repeats. iacn of 

the numbers produced are odd and belon:s to one of two mutually exclu- 

Sive integer sets eecnaing on the selection of Y, where, if r belongs 

to one set r+2 cannot. These two sets between them exhaust all of the 
7 


possible odd numbers in the interval 1 to 2. Because the numbers are 


odd this means that the two least significant bit positions in the 


binary representation of a number will not change, that is, their cycle 
is zero. The third bit position has period ql the fourth 2° and so on. 
For this reason, if random digits are to be used only the most signifi- 
cant digits of a number should be considered. 

No mathematical rules have been developed to accurately determine a 
particular choice of A although it is clear that an unfortunate selection 
here can result in an unacceptable generator. Coveyou fel did offer 
a criterion for selecting A in terms of an approximation formula he 
derived for the reduction of serial correlation between the numbers. 
Subsequently, Greenberger [5 introduced a correction term to be 
applied to Coveyou's expression and demonstrated that even though serial 
correlation might be optimized, nonetheless, the generator might suffer 


from other shortcomings, such as producing long sequences of low numbers. 


3. Discussion of Equation (1-3) 
7 
Rotenberg 16] tested a sequence 


nS Eee (mod 2") (A-l)) 


where L and C are odd integers less than P and O=yYr=2?. In this 
case, a maximum period of 2P is obtained for any 

L=1 (mod h) (A-5) 
An L of the form (2h + 1) with A=2 satisfies the condition of (A-5) and 
was used by Rotenberg in his tests. He examined two values of (C, 
C = 1 and C = (.788) oo the latter value resulted from Coveyou's approxi- 
mation mentioned above. Rotenberg reported that there were no s: gnificant 


differences in his results using either one of these C values. 
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For the CDC 160, this generator becomes 
= (oA 47 
eee 1) YL. + CO (mod 27’ ) (A-6) 


This type has an edge on form (A-2) in that it requires one less add 
instructor for each new number. It also has a longer period although 
this does not seem too significant in that (1-6) will produce alk5 
distinct elements. 

hh. Sub Periods. 

It has been pointed out by Peach 16 that in the case of (A-6) 
certain harmonics exist throughout its period. These tend, he feels, 
to lend a greater stability to the generated numbers than should be ex- 
vected of a random sample. He illustrates his point by making frequency 
tests on large samples and notes that they appear to be unduly uniforn, 
although his results do not overwhelmingly indicate this. ‘whether too 
much stability is objectionable depends on the particular application. 
If a user wanted to estimate an unknown mean without bias then this 
characteristic might not be critical. If the variance is a point of 
interest too much stability could be critical. 

5. An Example. 
To illustrate the multiplicative congruential method consider 


equations (A-li) which with L=5, Y,=0, C = 5, P = 3 becomes 


rao (%,, + 1) (mod 27) (A-7) 


The sequence is determined as follows. 


Substituting Y, = 9 into equation (A-7) gives 


Yy =—5 (mod 23) 


of 





This means that Y, is the remainder after dividing 5 by 2 This gives 


¥z = 5. Substituting this back into (A-?) gives: 
eee 51 6 | (mod 3) 
ae ie ( / MmOGr< / 
invading 30 by 23 fives a remainder 6. Therefore %5 = 6. This is 


repeated giving a sequence ¥, = 5, Yo = 06, Y3=3, Yh, Ye = 1, 


¥6=2, Yo =7, Yes O. of period 2°, 


we 
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APPENDIX B 
PROGRAMMING TH# GENERATOR 
Method 


1. The Relation, 


= aie hi? 


feed ( ) 


4 


) Yp (mod 2 


NiGzeen 
Yo=2 


can be programmed for the CDC 160i as follows: 


(a) Define: MASK = 140000000090999090 Binary 
MASK2 = 20999909990909990 Binary 


y, = 26 


ial 
MACHINE LANGUAGE ROUTINE: 
ENG (0). 
LDA (X,) 
LLS (19) 
SCL (ASK) 
ADD (X,) 
ADD (X,) 
ADD (X.) 


Su eee 


(b) At this point, the random number is in the accumulator and has 


been stored in X, in preparation for the next number if required. 


is now required tc convert it from its present integer form to a flceat- 


ing point fraction on the unit interval. This is done by continuing on 


with the following instructions: 





ARS (11) 
ADD (MASK2) 
FAD (MASK2) 
iiemenas bime, the desired uniformly distributed 9,1] mumber 15 an 
the A- register. The time of generation is about 90 micro-seconds per 


number. 


ho 


a 











PFits is 56897 


B238& Barron 
aaa Random number gener- 


ation on the CDC 1604. 





